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ABSTRACT 

This is the fourth in a series of papers studying the astrophysics and cosmology of massive, 
dynamically relaxed galaxy clusters. Here, we use measurements of weak gravitational lens¬ 
ing from the Weighing the Giants project to calibrate Chandra X-ray measurements of total 
mass that rely on the assumption of hydrostatic equilibrium. This comparison of X-ray and 
lensing masses provides a measurement of the combined bias of X-ray hydrostatic masses 
due to both astrophysical and instrumental sources. Assuming a fixed cosmology, and within 
a characteristic radius (r 25 oo) determined from the X-ray data, we measure a lensing to X-ray 
mass ratio of 0.96 + 9%(stat) + 9%(sys). We find no significant trends of this ratio with mass, 
redshift or the morphological indicators used to select the sample. In accordance with pre¬ 
dictions from hydro simulations for the most massive, relaxed clusters, our results disfavor 
strong, tens-of-percent departures from hydrostatic equilibrium at these radii. In addition, we 
find a mean concentration of the sample measured from lensing data of C 200 = 3.0j;j g. Antici¬ 
pated short-term improvements in lensing systematics, and a modest expansion of the relaxed 
lensing sample, can easily increase the measurement precision by 30-50%, leading to simi¬ 
lar improvements in cosmological constraints that employ X-ray hydrostatic mass estimates, 
such as on from the cluster gas mass fraction. 

Key words: gravitational lensing: weak; galaxies:clusters:general; cosmology:observations; 
X-rays:galaxies:clusters 


1 INTRODUCTION 

Total masses of galaxy clusters are of central importance in 
multiple cosmological measurements, notably those involving the 
cluster baryon fraction and mass function (for a review, see e.g. 
Allen et al. 2011). Historically, much of this work has used mass 
estimates from X-ray data, ultimately relying on the assumption 
of hydrostatic equilibrium. In general, for a randomly selected 
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cluster and arbitrary measurement radius, simulations predict 
that departures from equilibrium will induce bias at the tens 
of per cent level between such estimates and the true cluster 
masses (Lau et al. 2009; Nagai et al. 2007). Restricting the 
analysis to intermediate measurement radii and the most massive, 
dynamically relaxed clusters reduces the expected bias and scatter 
considerably ( 10 per cent). The small intrinsic scatter seen 

in gas mass fractions, /gas , for such clusters (e.g.. Paper II of 
this series, Mantz et al. 2014) verifies that hydrostatic X-ray 
mass estimates trace the true mass well at these radii for these 
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clusters. However, some overall average bias may remain, whether 
due to residual non-thermal support or instrument calibration 
(Nelson et al. 2014; Nevalainen et al. 2010; Schellenberger et al. 2015). 
Determining this bias requires independent mass measurements 
for the same clusters using an unbiased method. 

Weak gravitational lensing (WL) provides a direct probe of 
the gravitational potential of a cluster, independent of haryonic 
physics. While WL mass estimates are inherently noisy due to clus¬ 
ter triaxiality and line-of-sight structure, they can in principle pro¬ 
vide unbiased mass estimates on average (Bartelmann & Schnei¬ 
der 2001; Schneider 2006). In practice, systematic effects must be 
carefully controlled, as demonstrated by the Weighing the Giants 
project (WtG; Applegate et al. 2014; Kelly et al. 2014; von der 
Linden et al. 2014a). But, with this in hand, WL can provide the 
necessary overall calibration of X-ray hydrostatic mass estimates. 

A growing body of work has used WL to either directly calibrate 
cosmological tests or to understand the biases in published stud¬ 
ies (Hoekstra et al. 2015; Israel et al. 2014; Mantz et al. 2015b; 
von der Linden et al. 2014b). In addition to enhancing cluster cos¬ 
mology directly, measurements of the lensing to X-ray mass ratio 
constrain the net effect of astrophysical and instrumental biases on 
the X-ray measurements*. 

Several groups have previously compared lensing masses to 
X-ray masses measured in various ways (e.g. Donahue et al. 2014; 
Israel et al. 2014; Mahdavi et al. 2013; Planck Collaboration et al. 
2013; Vikhlinin et al. 2009; Zhang et al. 2010). However, those 
efforts have used different X-ray telescopes, different calibration 
versions for those telescopes, varying sample selection, and vary¬ 
ing lensing and X-ray analysis methods. As a result, it is difficult to 
draw conclusions any broader than the relative calibration of each 
individual set of WL and X-ray mass estimates. This only under¬ 
scores the need for cluster mass estimation with strict tolerances on 
systematic uncertainties, which WL can, in principle, provide. 

This is the fourth in a series of papers dedicated to studying 
the cosmology and astrophysics enabled by X-ray and robust lens¬ 
ing observations of dynamically relaxed clusters. Paper I (Mantz 
et al. 2015c) details the morphological selection of dynamically re¬ 
laxed clusters from a sizable subset of cluster observations in the 
Chandra data archive^ and the X-ray data processing of the sam¬ 
ple. Paper II (Mantz et al. 2014) derives cosmological constraints 
from the sample, using measurements of gas mass fractions, while 
Paper III (Mantz et al. 2015a) studies the thermodynamics and scal¬ 
ing relations of these clusters. The current paper uses the 12 clusters 
in common between our relaxed sample and WtG to constrain the 
ratio of lensing to X-ray mass estimates, a calibration that was used 
for the cosmological measurements in Paper II, and that impacts 
directly on the constraints on the cosmic mean matter density, £2^. 

The paper is organized as follows. In Section 2, we introduce 
our sample and mass measurements. We calibrate our X-ray hy¬ 
drostatic masses with lensing in Section 3 and investigate the ro¬ 
bustness of the measurement in Section 4. We measure the average 
concentration for the relaxed cluster sample in Section 5. Finally, 
in Section 6, we provide physical explanations for our results and 
place them in context with other literature efforts. Additionally in 
Section 6, we discuss our blinding strategy and examine the cos¬ 
mology dependence of the lensing to X-ray mass ratio. Concluding 
remarks are presented in Section 7. 


' Unless otherwise explicitly stated, the term “X-ray masses” refer to hy¬ 
drostatic mass estimates. 

^ http://cxc.harvard.edu/cda 


Unless otherwise noted, all mass measurements assume a flat 
ACDM reference cosmology with £!„, = 0.3, Da = 0.7 and H(, = 
100/ikm/s/Mpc, where h = 0.7. 


2 OBSERVATIONS, MASS MEASUREMENTS, & 
STATISTICAL METHODS 

In this section, we describe how the sample was selected and review 
how the X-ray and lensing masses were measured. 

2.1 Sample Selection 

Our sample selection is motivated by the goal of measuring the 
lensing to X-ray mass ratio in as many massive, dynamically re¬ 
laxed clusters as possible, while maintaining robust control of sys¬ 
tematic uncertainties. As described in Paper I, we conducted a sys¬ 
tematic search for relaxed clusters from 361 cluster observations 
in the Chandra and ROSAT archives. The dynamical state of each 
cluster was evaluated using an automated morphological classifi¬ 
cation. Three discriminators were used to determine the morpho¬ 
logical state of a cluster: sharpness of the surface brightness peak, 
offsets in the centers of neighboring isophotes, and offsets of in¬ 
dividual isophotes from a global center. Surface brightness levels 
used for the isophote measurements were selected to probe equiv¬ 
alent physical scales, assuming clusters evolve following the self¬ 
similar model of Kaiser (Kaiser 1986; see also Santos et al. 2008). 
This procedure was designed to fairly compare clusters at varying 
redshifts and signal-to-noise levels, while also avoiding strong as¬ 
sumptions about cosmology (for example, the expansion history). 
The employed criteria correlate with traditional measures of mor¬ 
phological disturbance such as surface brightness “concentration” 
and centroid variance (see Paper I for details). Paper II imposed 
further requirements for the inclusion of targets in the cosmological 
analysis, namely a cut on the average gas temperature (kT > 5keV), 
as well as stricter data quality cuts. 

For the lensing comparison, we use the Weighing the Giants 
(WtG) sample (von der Linden et al. 2014a), which comprises 51 
X-ray selected clusters with Subaru-SuprimeCam follow-up data 
(Miyazaki et al. 2002). The clusters used in this analysis are listed 
in Table 1. We refer to the 12 clusters in common between WtG and 
the dynamically relaxed cluster sample above as the “relaxed WtG” 
sample in this paper. For our tests of morphology dependence of the 
lensing to X-ray ratio, we supplemented this sample with hve ad¬ 
ditional clusters common to Allen et al. (2008) and WtG that fail 
at least one morphological criteria of Paper I, which we call the 
“marginal” sample. By using the WtG analysis, we benefit from 
uniformly derived, accurate WL masses for all clusters in the com¬ 
parison. 

2.2 X-ray Hydrostatic Masses 

Our X-ray mass estimates are derived from Chandra data, using 
the analysis procedure described in Papers I and 11. However, we 
have used a newer version of the Chandra calibration, resulting 
in updated hydrostatic masses. For this paper, the data were pro¬ 
cessed with CIAO^ version 4.6.1 and CALDB"* version 4.6.2 fol- 

^ http://cxc.harvard.edu/ciao 
'* http://cxc.harvard.edu/caldb 
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Table 1. The sample of massive, dynamically relaxed clusters used in this work. Column [1] name; [2] adopted redshift; [3], [4] J2000 coordinates of adopted 
cluster center; [6] Chandra clean exposure time (ks); [7] Optical filter coverage; [8] Lensing Band (exposure in s, seeing). See Paper I for details of the X-ray 
data-set and processing; see von der Linden et al. (2014a) for details of the lensing data-set, filter definitions, and processing. 


Cluster 

z 

RA 

Dec 

X-ray Exposure 

Optical Filters 

Lensing Band 




Relaxed WtG Sample 



Abell 2204 

0.152 

16;32:47.1 

05:34:31.4 

89.4 

BiV,Rcg*r* 

Vj (1038, 0.58) 

RXJ2129.6±0005 

0.235 

21;29:39.9 

00:05:18.3 

36.8 

BjFjRcF 

Vj (1863,0.58) 

Abell 1835 

0.252 

14;01:01.9 

02:52:39.0 

205.3 

V,Ici^g*r* 

t (1944, 0.91) 

MS2137.3-2353 

0.313 

21;40:15.2 

-23:39:40.0 

63.2 

B,V,RcIcz* 

Rc (5562, 0.57) 

MACSJ1115.8±0129 

0.355 

11;15:51.9 

01:29:54.3 

45.3 

ViRcIc 

Rc (1944,0.65) 

RXJ1532.9±3021 

0.363 

15;32:53.8 

30:20:58.9 

102.2 

BjVsRcIcz+u* 

Rc (2106, 0.55) 

MACSJ1720.3±3536 

0.391 

17;20:16.8 

35:36:27.0 

53.2 

B,V,RcIcz* 

V, (1944, 0.69) 

MACSJ0429.6-0253 

0.399 

04;29:36.1 

-02:53:07.5 

19.3 

ViRcIc 

Rc (2592, 0.73) 

RXJ1347.5-1145 

0.451 

13;47:30.6 

-11:45:10.0 

67.3 

BiViRcIcz*u*g*r*i*z* 

Rc (2592, 0.69) 

MACSJ1621.6±3810 

0.461 

16;21:24.8 

38:10:09.0 

134.0 

BsVjRcIcz+u* 

Ic (1568, 0.52) 

MACSJ1427.3±4408 

0.487 

14;27:16.2 

44:07:31.0 

51.0 

V,Rcz^ 

Rc (2544, 0.59) 

MACSJ1423.8±2404 

0.539 

14;23:47.9 

24:04:42.3 

123.0 

BjVjRcIcz+u* 

Ic (1944, 0.73) 




Marginal Sample 



A963 

0.206 

10:17:03.562 

39:02:51.51 

38.3 

ViRcIc 

Ic (2700, 0.61) 

A2390 

0.233 

21:53:37.070 

17:41:45.39 

79.4 

BiViRcIci*z*u* 

Rc (3420,0.56) 

A611 

0.288 

08:00:56.818 

36:03:25.52 

30 

BiVjRcIcg^r^ 

Ic (1896, 0.62) 

MACSJ0329.6-0211 

0.450 

03:29:41.459 

-02:11:45.52 

22.2 

BjTjRc/cz+m* 

Vj (1944, 0.55) 

MACSJ0744.8±3927 

0.698 

07:44:52.310 

39:27:26.80 

73.2 

BjFjRc/cFz+m* 

Rc (4869,0.56) 


lowing standard procedures.^ The intracluster medium was mod¬ 
eled as a spherically symmetric, piece-wise isothermal atmosphere 
in hydrostatic equilibrium with a gravitational potential given by 
the Navarro-Frenk-White (NFW; Navarro et al. 1997) model. The 
center for this deprojection was chosen to maximize the symme¬ 
try of the emission at radii ~ r 2 soo, where the mass profiles are 
best constrained (and where we perform the comparison to lens¬ 
ing masses). The radius r 25 oo is where the mean enclosed density is 
2500 times the critical density of the universe at the cluster’s red- 
shift, as determined by the X-ray observations. Appropriate fore¬ 
ground and background components were included in the spectral 
analysis, along with a model for the cluster emission. The model 
was fitted to the photon counts in the 0.6-7.0 keV band using the 
modified C-statistic in XSPEC (Arnaud 1996) in energy bins, de¬ 
fined to have at least one count per bin. See Paper II for more de¬ 
tails. 

2.3 Weak Lensing Masses 

The lensing analysis for this work is similar to that presented in 
the Weighing the Giants papers (Applegate et al. 2014; Kelly et al. 
2014; von der Linden et al. 2014a). Here, we summarize the essen¬ 
tial elements of the mass measurement process that are referenced 
later in the text, and highlight all small changes in the modeling 
procedure. 

WL masses were derived from multi-filter observations with 
SuprimeCam at Subaru and Megacam at CFHT. We generated 
shear catalogs for each cluster field using the KSB-t- (Hoekstra 
et al. 1998; Kaiser et al. 1995) moments-based code analyseldac, 
described in Erben et al. (2001). Absolute shear calibration was 
determined from the STEP2 simulations, which were designed to 
replicate SuprimeCam observations (Massey et al. 2007). The clus¬ 
ter shear profiles were modeled as NFW halos over a cluster-centric 
radial range 750kpc-3Mpc. We adopted the X-ray center when 
constructing the ID average shear profiles. Note that the restriction 
of shear measurements to relatively large cluster radii makes our 

^ http://cxc.harvard.edu/ciao/guides/acis_data.html 


mass estimates insensitive to the exact choice of center (von der 
Linden et al. 2014a). 

Two different methods were used to measure the redshift dis¬ 
tribution of galaxies selected as lensed sources: the “color-cut” 
method and the F (z) method. Both methods are described in Ap¬ 
plegate et al. (2014), where we demonstrated that the F(z) method 
is unbiased and that our implementation of the “color-cut” method 
yields consistent results. We briefly describe each method below. 
For the analysis of Section 3, we adopted the color-cut masses 
cross-calibrated with P (z) masses as our baseline, thus maximiz¬ 
ing the sample size (12 clusters versus 6 with the F(z) method) 
while maintaining uniform systematic uncertainties. 

For the “color-cuf’ method, we matched the galaxy popula¬ 
tion of each cluster field to the COSMOS deep field (Ilbert et al. 
2009), for which high quality 30-1- filter photometric redshifts are 
available. To combat signal dilution by contaminating cluster mem¬ 
bers, we removed the red sequence for each cluster. We then per¬ 
formed a “contamination correction”, where the measured signal 
was boosted using the average excess galaxy number counts ob¬ 
served in the WtG sample. 

For the F (z) method, we calculated photometric redshift prob¬ 
ability functions for individual galaxies from five filter Bj VjRctcZ^ 
photometry (Kelly et al. 2014). We measured the NFW halo prop¬ 
erties of the cluster with an unbinned Bayesian model fitted to 
the measured shear at each galaxy position. During the fit, we 
marginalized over the redshift probability function for each galaxy. 
P (z) masses are used in this work to test the robustness of the lens¬ 
ing to X-ray mass ratio measurement and to measure the average 
concentration of the sample. 

We adopted a prior on the NFW concentration from Neto 
et al. (2007), using results for relaxed clusters from their largest 
mass bin, logjo M 2 oo^l'Mo = 14.875-15.125. The prior probabil¬ 
ity F(log]Q c) was modelled as a Gaussian distribution with mean 
0.664 and standard deviation 0.061. Note that this differs slightly 
from Applegate et al. (2014), for which the analysis was not re¬ 
stricted to dynamically relaxed clusters, and where a Gaussian prior 
of logjQC = 0.6 ±0.116 representing clusters in all dynamic states, 
was used. 
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Previous work has shown that modelling a sample of clus¬ 
ter lensing observations as NFW halos following a fixed mass- 
concentration relation can lead to small biases in the average 
mass of the sample (Bahe et al. 2012; Becker & Kravtsov 2011). 
This mass bias can be traced to the use of an incorrect mass- 
concentration relation and to deviations of the cluster density pro¬ 
file from NFW at radii beyond the cluster virial radius. To correct 
both of these effects, we replicate our mass modeling procedure on 
mock lensing observations of the Millennium-XXL simulation (S. 
Flilbert et al. in prep; see also Angulo et al. 2012). Millennium- 
XXL features 303 billion particles in a 4.1 Gpc box, providing 
over one hundred halos in a mass range matching the clusters in 
this study. We replicated our analysis on two simulation snapshots 
where mock lensing observations were available, z = 0.25 and 
z= 1.0. 

For this study, we find that a correction factor constant in mass 
is sufficient to characterize the bias in M 25 oo- At z = 0.25, we find a 
correction factor of 1.03±0.01 (stat only), increasing our masses by 
« 3%. At z = 1.0, we find a correction factor of 1.10 ± 0.02. We lin¬ 
early interpolate the correction to each cluster redshift in our sam¬ 
ple and marginalize over the statistical uncertainty when fitting for 
the lensing to X-ray mass ratio. We discuss additional systematics 
associated with this simulation-based correction in Section 4.3.2. 
A complete discussion of how we derive this correction and the re¬ 
lated systematic error estimates will be presented in Applegate et 
al., in prep. Note that this simulation correction is new since the 
publication of Paper II in the series. 

In Applegate et al. (2014), we discussed the relevant system¬ 
atic uncertainties in measuring the mean lensing mass of the full 
51 cluster sample. We attributed a 4% systematic uncertainty to our 
shear measurements. The determination of the photometric-redshift 
distribution contributed an additional 3% systematic uncertainty. 
We estimated a 4% uncertainty in the calibration of the color-cut 
lensing masses by P{z) masses. The total systematic from these 
three sources is » 6%. We had previously included a 3% system¬ 
atic uncertainty attributed to modeling the mass distribution in the 
clusters. We update the mass model uncertainty estimate, in light 
of the new simulation-based correction, in Section 4.3.2. 

Triaxiality of cluster halos and line-of-sight structure con¬ 
tribute additional noise to lensing measurements, estimated to be » 
20% (Becker & Kravtsov 2011; Hoekstra 2003). The Millennium- 
XXL mock observations allow us to estimate the intrinsic scatter 
from the matter distribution within 100 h^'Mpc (physical) of the 
cluster center. We model the intrinsic scatter in the simulations as a 
log-normal distribution, finding cri„, = 0.23 ± 0.01 at z = 0.25 and 
(Tint = 0.28 ± 0.01 at z = 1.0. However, a 100 h^'Mpc integrated 
length does not account for all scatter induced by line of sight struc¬ 
ture. While we could follow the method of Hoekstra (2003) to es¬ 
timate the scatter component induced by large scale structure, we 
elect to simply fit for the total magnitude of intrinsic scatter when 
modeling the lensing to X-ray mass ratio. 

2.4 Statistical Methods 

The X-ray to lensing mass ratio is poorly determined by a simple 
averaging procedure. First, lensing masses genetically have large 
uncertainties that are non-Gaussian (nor log-normal) in shape, even 
for the most massive clusters with the best data. Second, the uncer¬ 
tainties in the lensing masses are correlated with the X-ray masses 
when measured within an aperture determined from the X-ray data. 
Neglecting this correlation will artificially boost the apparent pre¬ 
cision of the measurement. Third, we expect an approximately log¬ 


normal intrinsic scatter between X-ray and lensing masses, since 
the lensing masses are sensitive to cluster triaxiality and line of 
sight structure (Bahe et al. 2012; Becker Sc Kravtsov 2011). This 
scatter is of a comparable magnitude to the WL measurement un¬ 
certainties, so neither can be safely neglected. 

While none of these issues are new, their combination limits 
the applicability of standard approaches. Fitting packages such as 
LiNMix_ERR from Kelly (2007) or BCES from Akritas <& Bershady 
(1996) assume that the forms of the measurement error and the 
intrinsic scatter are the same (Gaussian). Both codes also restrict 
users to a simple two-parameter linear model. 

To avoid these pitfalls, we can instead write down the posterior 
probability distribution for the lensing to X-ray mass ratio without 
approximation. Specifically, we can incorporate the exact probabil¬ 
ity distributions for NFW halo parameters that describe the lensing 
and X-ray data sets. This correctly incorporates both detections and 
non-detections into the ratio, while also allowing us to directly ac¬ 
count for the correlation between lensing and X-ray masses. We 
model the intrinsic scatter from triaxiality and line of sight struc¬ 
ture as a log-normal distribution, independent of the form of the 
measurement errors. Appendix A explicitly presents the posterior 
probability function for the lensing to X-ray mass ratio. 

This basic ratio model can be easily extended to generalize 
the relationship between lensing and X-ray masses or to linearly 
regress the ratio against other variables. When the independent vari¬ 
ables in these fits have non-negligible uncertainty, for example the 
cluster morphology discriminators we consider in Section 4.2, we 
must also include a prior on the true, unobserved distribution of 
that variable. For this task, we adopt the hierarchical mixture of 
Gaussians prior employed in Kelly (2007), with uniform priors on 
Gaussian scale parameters {i.e. cr) following Gelman (2006). By 
default, we report results for mixtures of two Gaussians, but have 
verified that results are not sensitive to the exact number employed. 

Throughout the paper, we adopt uninformative priors when 
possible. For example, we adopt a uniform prior on the log of 
lensing to X-ray mass ratio, which equally prefers ratios greater 
or less than one. We also adopt uniform priors on intrinsic scat¬ 
ter cTin,. We quote maximum a posteriori parameter point estimates, 
transforming probability distributions and point estimators appro¬ 
priately when log transforms are applied to variables. 

For the majority of this paper, we consider the lensing to X- 
ray mass ratio at a fixed cosmology. However, both the lensing and 
X-ray masses inferred from the data implicitly depend on the cos¬ 
mological model. We comment on the resulting dependence of the 
lensing to X-ray mass ratio on cosmological parameters in Sec¬ 
tion 6.3. When using the lensing data to calibrate cluster masses 
in a cosmological test, such as the gas-mass fraction ) con¬ 
straints of Paper II, it is best to directly incorporate the full model 
described above; for each trial cosmology, model parameters are in¬ 
ferred from the measured shear profiles (as a function of angle) and 
galaxy redshift distributions. In the case of Paper II (see also Allen 
et al. 2008), the model for the X-ray data includes a parameterized 
redshift-dependent scaling, K(z) = Ko(l + Ki z), encoding the aver¬ 
age ratio of true cluster masses to X-ray mass estimates at a given 
cluster redshift. Given the expectation that our lensing masses are 
unbiased within a 7% tolerance, we can interpret this function as 
the lensing to X-ray mass ratio, and use the likelihood associated 
with the lensing data (Appendix A) to constrain the model for K(z) 
(in practice, the current data can constrain Kq, but K[ must be con¬ 
strained by a prior). For simplicity, the implementation in Paper II 
neglects the cosmology dependence of the mass-concentration re¬ 
lation as well as the sub-dominant uncertainties from the contami- 


MNRAS 000, 1-13 (2015) 



Chandra Hydrostatic Mass Calibration 5 


1.6 


§ 1.4 



O' 0.6 

'iJi 

c 

01 0.5 


02 03 04 oF 

Cluster Redshift 


Figure 1. Measured weak lensing to Chandra X-ray mass ratio plotted 
against cluster redshift for the relaxed WtG sample of clusters. Lensing 
masses are measured within the X-ray defined r 25 oo and use the color-cut 
analysis. Data points are for individual clusters, where en'or bars represent 
68% confidence infervals. The dashed line and orange bands indicate the 
ensemble mass ratio, 68% and 95% ranges. The ratio and statistical 68% 
range is 0.967;';jj or 0.956 ± 0.082 when approximated as a Gaussian 
distribution. Additional systematic uncertainties of 9% apply, for a total un¬ 
certainty of « 12%. The y-axis and associated probability distributions are 
log transformed. 


nation correction and the lensed galaxy redshift distribution (char¬ 
acterized hy (yS) and (y6^)); due to the small size of the comparison 
sample, these effects are much smaller than the statistical uncer¬ 
tainties. The same approach was also used in Mantz et al. (2015b). 


3 WEAK LENSING TO CHANDRA X-RAY RATIO 

In this section we present our measurements of the lensing to Chan¬ 
dra X-ray mass ratio for our sample of 12 relaxed WtG clusters. We 
first present results for the ratio measured within r 2 soo, where this 
radius is determined from the X-ray data. This is the appropriate 
ratio to use in the /gas model described in Paper II. We then present 
results for the mass ratio M 2500 /M 2 S 00 ’ "'here each data set indepen¬ 
dently determines its own value of r 25 oo- Finally, we also report the 
WL to X-ray mass ratio measured within the X-ray defined rsoo- 

We first consider the mass ratio within the X-ray defined r 25 oo, 
as plotted in Figure 1. We measure a mass ratio of M^IMx - 
0.967^gg®2, where the quoted uncertainties are the statistical 68% 
confidence interval after marginalizing over the intrinsic scatter. 
The best Gaussian approximation of the posterior PDF is MijMx - 
0.956 ± 0.082. From the statistical uncertainties alone, the ratio is 
fully consistent with unity. 

The measurement is also subject to systematic uncertainties 
from the WtG WL analysis (Section 2.3). Whereas the WtG anal¬ 
ysis of Applegate et al. (2014) measured masses within 1.5 Mpc, 
the current measurement is at r 25 oo, typically a factor of ~ 3 
smaller. This requires us to extrapolate our shear measurements 
inwards, thereby increasing our sensitivity to priors on the mass- 
concentration relation. At this radius, the mass-concentration re¬ 
lation contributes an additional 6% systematic uncertainty, which 
we discuss in Section 4.3.2. Therefore, the total WL systematic un¬ 


certainty is 9%, resulting in a combined statistical and systematic 
uncertainty for the mass ratio measurement of 12%. 

We measure an intrinsic scatter of crint = 0-146^Q gjg present in 
the lensing to X-ray ratio. This amount of intrinsic scatter is consis¬ 
tent at Icr with the intrinsic scatter predicted by the MXXL simula¬ 
tions and with the intrinsic scatter detected for the full WtG cluster 
sample (Mantz et al. 2015b). 

Next, we measure the ratio within r 25 oo apertures determined 
independently from the lensing and X-ray data. This alternative ra¬ 
tio is relevant for calibrating mass proxies in some circumstances.® 
These results should be noisier, as WL cannot easily measure r 25 oo- 
The median mass ratio and 68% statistical uncertainty is 0.90j;Q|2, 
with a measurement of intrinsic scatter of cri„, = 0.25);j5 [, The same 
9% systematic uncertainty applies, resulting in a total measurement 
precision of 16%. 

Finally, we measure the ratio within the X-ray defined rjoo. 
Note that this is in some cases an extrapolation of the X-ray data 
to larger radii, but is well measured by the WL observations. We 
measure a WL to X-ray ratio of 1.059^® when using a WL cor¬ 
rection factor derived from the Millennium-XXL simulations that 
is appropriate for rsoo- Again, a total WL systematic uncertainty of 
9% applies. 


4 DISCUSSION OF SYSTEMATICS 

We investigate the possible influence of a number of systematics 
that in principle could alter the measured mass ratio. We start with 
the adopted Chandra temperature calibration. We then verify the 
robustness of the measurement to the definition of the cluster sam¬ 
ple and the details of the lensing analysis. Finally, we search for 
trends in the X-ray to lensing ratio that correlate with cluster mass 
or redshift. 

4.1 Chandra Calibration 

X-ray hydrostatic masses, and therefore the lensing to X-ray mass 
ratio, are dependent on the effective area calibration adopted for 
Chandra. While Chandra calibration updates usually refine the 
most recent observations, some retroactively affect temperatures 
measured from older data. One such update happened after the pub¬ 
lication of Paper II of this series. To see how large an effect these 
calibration changes have on our measurements, we compared our 
measurements made above (using CIAO version 4.6.1 and CALDB 
version 4.6.2) with the calibration from May 2012 (CALDB ver¬ 
sion 4.4.10 and CIAO version 4.4) used in Paper 11. 

When X-ray mass estimates were compared between the two 
calibration versions for the 40 clusters in the full relaxed sample 
from Paper I, we saw that the newer calibration (the default used 
in this paper) lowers the average X-ray mass, M 2500 , by 5%, with 
a scatter of 5%. Restricting to only the 12 relaxed WtG clusters in 
the calibration sample examined in this paper, we see a downward 
shift of only 3%. The small shift in X-ray masses is reflected in 
an equally small shift in the lensing to X-ray mass ratio. With the 
May 2012 calibration, the lensing to X-ray mass ratio at the X-ray 
measured r 2 soo is 0.940^® °“, in comparison to our baseline results 
of 0 967+“ 

U.yU / g gg2- 

® We reiterate, however, that the result applies only for our reference cos¬ 
mology, and that in applications where cosmological parameters are al¬ 
lowed to vary, the only robust approach is to fit the mass ratio model si¬ 
multaneously with the cosmology. 
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This shows that our results are not overly sensitive to recent 
changes in the Chandra effective area calibration. More impor¬ 
tantly, the use of lensing data to calibrate the X-ray masses in cos¬ 
mological tests like that of Paper II corrects for any overall mass 
bias potentially introduced by the Chandra calibration. 

4.2 Sample Selection 

In Papers I and II, clusters were included in the cosmological anal¬ 
ysis with the /gas test based on X-ray morphology indicators. Any 
detected trend with morphology would indicate that the lensing to 
X-ray mass ratio, and therefore the /gas test, is sensitive to the se¬ 
lection criteria. In this section, we verify that no trend exists with 
any of these indicators. 

Since by construction all clusters in the relaxed WtG sample 
qualify as relaxed, we extended our sample with five additional 
clusters that failed one or more morphology criteria in Paper I (the 
“marginal” sample). Each of the five clusters have previously been 
included in subjective selections of relaxed clusters (Allen et al. 
2008; Donahue et al. 2014). 

As a first check, we compared the measured lensing to X-ray 
mass ratio for the relaxed WtG sample against the marginal sample. 
Using the five clusters in the marginal sample alone, the measured 
ratio within the X-ray measured r 2 soo is = 1.05^® 23 - In ad¬ 
dition, we detect an intrinsic scatter of cTin, = The ratio is 

consistent with the twelve clusters from the relaxed WtG sample, 
albeit with low precision. When all seventeen clusters are consid¬ 
ered, the ratio is M^/Mx - with an intrinsic scatter of 

(Tint = 0 - 22 ^Qg 25 , again consistent with the results of the relaxed- 
only sample. 

We also investigated linear trends in the lensing to X-ray mass 
ratio with each of the morphology measures used to define the sam¬ 
ple (see Paper I). The mass ratio is plotted against symmetry, peak- 
iness, and alignment in figure 2. To investigate trends in these pa¬ 
rameters, we adopt a linear model between each parameter and the 
log mass ratio, ta = a + f}X (see Section 2.4). 

We list the ID marginalized uncertainties for the slope and in¬ 
trinsic scatter for each morphology measure and both samples in 
table 2. When considering the 12 clusters in our relaxed WtG sam¬ 
ple, both Symmetry and Alignment exhibit an « 1(T preference for 
a non-zero slope. However, no trend is evident when the combined 
“relaxed WtG” and “marginal” samples are considered. We note 
that the intrinsic scatter appears to increase for clusters closer to 
the multidimensional selection boundary, as seen in Figure 2. 

From these tests, we conclude that our results on the WL to X- 
ray mass ratio are not overly sensitive to the exact morphological 
selection criteria used to select the sample. 

4.3 Lensing Analysis 

Weighing the Giants lensing masses measured within an aperture of 
at 1.5 Mpc should be unbiased within a 7% systematic uncertainty 
(Applegate et al. 2014). However, since the measurements here are 
at the smaller radius of » ^2500 and employ a new simulation-based 
correction factor, the systematic uncertainties of two components 
of the lensing analysis are worth reviewing. First, we explore how 
the lensing to X-ray mass ratio varies when we change how we cal¬ 
culate the background redshift distribution of lensed sources and 
correct for cluster galaxy contamination. Then, we quantify the sys¬ 
tematic uncertainty from the mass model and the simulation correc¬ 
tion. 


4.3.1 Lensed Source Redshift Distribution 

The Weighing the Giants project employed two methods to esti¬ 
mate the redshift distribution of lensed sources and address sig¬ 
nal dilution by cluster galaxies, the color-cut method and the P (z) 
method. The color-cut method statistically matches the galaxy pop¬ 
ulation in the cluster field with a reference deep field and corrects 
the shear profile for the diluting effects of cluster galaxies. In con¬ 
trast, the P (z) method estimates a redshift probability function for 
each galaxy in the cluster field and only includes galaxies that are 
believed to be in the background of the galaxy cluster. We used 
the color-cut method in this study to maximize our sample size 
while maintaining uniform systematic uncertainties. We previously 
showed that our mass measurements using the P (z) method were 
consistent with our implementation of the color-cut method used 
in this analysis (Applegate et al. 2014). Here we check if using the 
alternative P (z) method produces consistent results for the six clus¬ 
ters in the relaxed WtG sample where a comparison is possible. 

Figure 3 shows the constraints on the lensing to X-ray mass 
ratio within the X-ray measured r 25 oo for the two sets of clusters in 
the analysis, those with and without P (z) mass measurements. For 
the clusters with P (z) mass measurements, we calculated the lens¬ 
ing to X-ray mass ratio twice, once with the P (z) method and once 
with the color-cut method. Both analyses are statistically consistent 
with the independent set of color-cut only clusters. 

For the six clusters where both the color-cut and P (z) methods 
can be applied, the measured ratios differ by 11%. Scatter between 
the methods is expected: each method selects a somewhat different 
population of lensed galaxies for analysis, resulting in » 20% scat¬ 
ter between these measurements at z « 0.2, growing to « 40% at 
z « 0.6 (Applegate et al. 2014). To evaluate the statistical signifi¬ 
cance of the observed offset between methods for the six clusters in 
our sample, we created random samples of six clusters from the set 
of 27 clusters with P{z) measurements in Applegate et al. (2014). 
The distribution of offsets between P (z) method masses and color- 
cut method masses for these subsamples is consistent with zero and 
has a population standard deviation of cr = 16%. The offset de¬ 
tected between analyses for these six clusters is therefore within 
expectations. 


4.3.2 Lensing Mass Model 

For this study, we have used the NFW density profile to model the 
observed shear signal of galaxy clusters. We have also assumed a 
mass-concentration relation, both to reduce the nonlinearity of the 
model and to compensate for the inability of the lensing data to 
constrain the density profile shape. However, studies of mock lens¬ 
ing observations of N-body simulations have shown that modeling 
a population of realistic clusters as isolated NFW halos can lead to 
small biases in the average mass of the sample (Babe et al. 2012; 
Becker &. Kravtsov 2011). Lensing masses may also be sensitive 
to the choice of mass-concentration relation, either when modeling 
shear information at small cluster-centric radii or, as in this work, 
when extrapolating mass measurements to small radii. 

To correct for these effects, we replicated our lensing mea¬ 
surement procedure on mock observations of the Millennium-XXL 
(MXXL) N-body simulation and introduced a correction factor (see 
Section 2.3). While the statistical uncertainty of the correction fac¬ 
tor is small (less than 2%), additional systematic uncertainties arise 
from mismatches between the simulation and reality. The mass- 
concentration relation introduces the largest systematic uncertainty. 
The simulation correction factor effectively fixes our choice of 
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Figure 2. The sample ratio (black dotted), 68% and 95% confidence intervals (orange, shaded), and ratios for individual clusters, plotted against three X-ray 
morphology measurements: (left) symmetry, (center) peakiness, and (right) alignment. The vertical dashed red lines show the thresholds that define the relaxed 
region. Black data points are the relaxed WtG sample, while blue data points are the marginal sample. 


Table 2. Measured slopes and offsets for the best fitting linear relationship between the lensing to X-ray mass ratio versus three morphology measures, for the 
relaxed WtG and marginal samples. Quoted are the ID 68% marginalized uncertainties for each quantity. 



Relaxed WtG Sample 

Relaxed WtG + Marginal Sample 

Morphology Measure 

Slope 

Offset 

Pivot 

Slope 

Offset 

Pivot 

Symmetry 

24 

0 94+^ u / 

1.18 

0 07+O 25 
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-0.602 

-0.53 

-0.66 
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Figure 3. Results for three subsets of six clusters, using both the color-cut 
analysis and the P (z) analysis. Data points with error bars show 68% con¬ 
fidence intervals for each cluster subset, while the dark orange and light 
yellow shaded regions are the 68% and 95% confidence intervals for the 
full sample. Groups one and two are identical clusters, but analyzed using 
the P (z) and color-cut methods, respectively. Group 3 comprises six inde¬ 
pendent clusters where only the color-cut analysis can be applied. Groups 
1 and 2 are statistically consistent with group 3 at the Icr level. Groups 1 
and 2 are also statistically consistent when the covariance between the two 
measurement methods is fully accounted for. Note that the correction for 
biases in the mass model has not been applied here. 


mass-concentration relation, on average altering the masses to what 
we would have measured using the mass-concentration relation re¬ 
alized in the MXXL. However, a residual bias may remain due 
to the cosmology dependence of the mass-concentration relation 
(Diemer & Kravtsov 2015; Ludlow et al. 2013). We quantify our 
sensitivity to the mass-concentration relation by varying the peak 
of our prior on concentration by ±20%. This range is motivated by 


Bhattacharya et al. (2013), who measured differences in the mass- 
concentration relation amplitude of « 15% between their results 
and Duffy et al. (2008), as well as between results based on the 
Millennium Simulation (Gao et al. 2008; Hayashi & White 2008; 
Neto et al. 2007). The measured WL to X-ray mass ratio at r 25 oo 
varied by ±6%. 

The simulation correction derived from MXXL is based on a 
mass-limited selection of halos. In contrast, the morphology selec¬ 
tion from Paper I should select clusters that have less substructure 
than the general population, yet has not significantly altered the 
ellipticity distribution of the clusters Mantz et al. (2015c). This 
should on average decrease the magnitude of observed intrinsic 
scatter but not alter the correction factor (Bahe et al. 2012). Based 
on the measured ellipticity distribution of the relaxed sample from 
Paper I, we also consider it unlikely that a conspiracy exists be¬ 
tween a preferential selection of clusters projected along the line of 
sight, leading to overestimated lensing masses, and a biased Chan¬ 
dra temperature calibration, leading to overestimated X-ray masses. 
Future work is required to replicate the Paper I selection function 
in cosmological simulations. 

In principle, lensing mass measurements can be sensitive to 
the choice of center used to measure the shear profile. However, 
we have previously shown that our shear measurements are mini¬ 
mally affected by the choice of cluster center (von der Linden et al. 
2014a). We have also studied the effects of shear profile miscen- 
tering by introducing random offsets into the MXXL mock lens¬ 
ing observations. When we perturb the assumed center following 
a probability distribution of BCG-X-ray offsets observed in Chan¬ 
dra archive observations of clusters (Paper I), we see a sensitiv¬ 
ity in the correction factor of only x: 3%. In addition, the relaxed 
WtG sample shows considerably reduced BCG-X-ray offsets than 
the general population, a further indication that these clusters are 
indeed dynamically relaxed (Paper I). We therefore expect system- 
atics from miscentering to be negligible for this sample. 

We have used two simulated redshifts from the MXXL to mea¬ 
sure our correction factor, at z = 0.25 and z = 1.0, and have lin¬ 
early interpolated in redshift between the measurements. However, 
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Figure 4. Lensing mass versus X-ray mass. The dark and light blue bands 
are the 68% and 95% confidence intervals allowed by the power-law model 
fit to the data. The dashed black line is the one-to-one line. The magnitude 
of the intrinsic scatter has been marginalized over. 

the bulk of clusters in the sample are at low redshift. To test the 
sensitivity of our measurements to the details of the interpolation 
scheme, we remeasured the lensing to X-ray ratio using only the 
correction factor from the z = 0.25 simulation. In this case, we 
measure the ratio to be 0.940^gg®.J, where the best Gaussian ap¬ 
proximation to the posterior PDF is yu = 0.948 ± 0.082, a shift of 
less than 2%. 

4.4 Mass or Redshift Trend 

In this section, we search for trends in the X-ray to lensing ratio 
with mass or redshift. Such trends would suggest that a more com¬ 
plex calibration model is needed for the /gas experiment. Previous 
work has suggested that a trend with mass may exist for hydro¬ 
static masses from Chandra, albeit at low statistical significance 
and for samples including all dynamic states (Israel et al. 2014). In 
our case, we emphasize that the baseline ratio model is an adequate 
description of the data, and that the small size of our sample makes 
precise measurements of extended models difficult. 

We first examine if the lensing to X-ray mass ratio exhibits 
a mass dependence. To do this, we extended the ratio model from 
Section 3 to include a power-law scaling between lensing and X- 
ray masses (again including intrinsic scatter), and model the in¬ 
trinsic distribution of X-ray masses as a sum of Gaussians (see 
Section 2.4). We used the mean X-ray mass as the pivot point for 
the power-law. A power law index of / = 1 indicates no mass de¬ 
pendence and reduces to the original ratio model. Figure 4 shows 
the inferred power-law. The statistical 68% confidence interval for 
the power law index, with all other variables marginalized over, is 
j8 = 0.78^55 The power law index/ is less than unity at less than 
2cr significance. We therefore find no statistical evidence for a trend 
with mass. ^ 

^ It is an interesting to note that when we consider the “relaxed WtG” plus 
the marginal cluster samples, the power law index /J is measured to be /? = 
0.69 ± 0. 13, which is less than unity at more than Icr significance. However, 
it is hard to interpret this measurement without a more robustly defined 
sample. 


We next check for an explicit redshift dependence in the lens¬ 
ing to X-ray ratio. We extended the ratio model from Section 3 to 
include a term linear in cluster redshift, such that In M^sool^ioo ~ 
a + Pz, using the mean redshift as the pivot point in the fit. We 
measured a best fit slope of -O.SO^g gj, i.e. a decreasing mass ra¬ 
tio towards higher redshifts. However, the results are statistically 
consistent with a constant value at approximately Icr. 


5 AVERAGE CONCENTRATION OE MASSIVE, 
RELAXED CLUSTERS 

The mass-concentration relation for massive halos influences 
the measured lensing to X-ray mass ratio (as discussed in Sec¬ 
tion 4.3.2). It also reflects the formation history of halos and carries 
some weak information on cosmology (Ludlow et al. 2013; Wech- 
sler et al. 2002). By design, the WtG mass measurements are mini¬ 
mally sensitive to assumptions about the cluster concentration (Sec¬ 
tion 4.3.2) However, some partial sensitivity is still present, which 
we use here to measure the average concentration of the massive, 
relaxed clusters in our sample. 

As opposed to the rest of this study, for this exercise we jointly 
fit the concentration of six clusters with P{z) measurements. We 
limit ourselves to the P{z) clusters to avoid systematic uncertain¬ 
ties from the “contamination correction” required in the color cut 
analysis (see Applegate et al. 2014). The contamination correction 
alters the slope of the shear profile, and is therefore degenerate with 
the measured concentration. We also remove the prior on concen¬ 
tration from Section 2.3. 

Since we designed our measurements to be insensitive to con¬ 
centration, we do not expect to measure the concentration of indi¬ 
vidual clusters with any useful precision. In addition, we expect ad¬ 
ditional noise from triaxiality and large-scale structure (Bahe et al. 
2012). We therefore simultaneously model the population of clus¬ 
ters to measure an average concentration. We assume that the con¬ 
centrations of clusters will have a log-normal scatter around the 
average concentration. Specifically, our model consists of a mass 
(M{< 1.5Mpc)) and concentration for each cluster, the mean of 
the log-normal distribution of concentrations, yu,,, and the scatter in 
concentration measurements, cTc. We assume flat priors on and 
<Tc, and restrict to the range [0.286, 0.318], based on the largest 
mass bin reported in the simulation results of Bahe et al. (2012). 

We measure the average concentration to be = 3.0];|g. 
While hardly discriminatory, this fit is consistent with our adopted 
prior on the concentration (jj.^ = 4.6) for measuring individual 
masses of clusters in previous sections. 

Other groups have shown that WL observations can in prin¬ 
ciple achieve up to 10% precision on average concentration mea¬ 
surements for an ensemble of clusters (Okabe et al. 2013). How¬ 
ever, this precision requires fitting NFW halo models to small radii 
(typically ~ 150kpc). At these radii, the measured shear from these 
massive clusters exceeds g ~ 0.1, and often exceeds g ~ 0.3 (see 
example shear profiles in von der Linden et al. 2014a), while shear 
measurement codes have only been rigorously calibrated to shears 
of g ~ 0.05 (Bridle et al. 2010; Kitching et al. 2010; Massey et al. 
2007). Extending the shear calibration to this regime is an ongoing 
effort (LSST Dark Energy Science Collaboration 2012). 
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6 DISCUSSION 

6.1 Interpreting the weak lensing to Chandra X-ray mass 
ratio 

We have measured a WL to Chandra X-ray mass ratio of 0.956 ± 
0.082, with an additional 9% systematic uncertainty from the lens¬ 
ing analysis. Combining the statistical and systematic uncertainties 
in quadrature gives « 12% precision, making the measured mass 
ratio consistent with unity at < Icr. 

Hydrodynamic simulations have examined X-ray hydrostatic 
masses with different methodologies and baryonic physics pre¬ 
scriptions. Simulations agree that, even for the most relaxed clus¬ 
ters, hydrostatic mass estimates should slightly under-report the 
true mass within r 25 oo, with < 10% bias (Battaglia et al. 2013; Lau 
et al. 2009; Nagai et al. 2007; Rasia et al. 2012). Therefore, these 
simulations would predict a lensing to X-ray mass ratio in the range 
of 1.0 - 1.1 at these radii. Our results provide a 95% confidence up¬ 
per limit on the lensing to Chandra X-ray mass ratio of » 1.15. 
This disfavors models with stronger violations of hydrostatic equi¬ 
librium in massive, relaxed clusters than predicted by current sim¬ 
ulations. 

However, we can only establish a truly robust limit on de¬ 
partures from HSE if we also have a firm understanding of the 
systematic uncertainties in Chandra cluster mass measurements. 
In particular, the temperature calibration for in-orbit X-ray detec¬ 
tors is an area of active research by the community (see http: 
//web .mit. edu/iachec/ and the documents linked therein). A 
pessimistic bound on Chandra systematic errors would add an ad¬ 
ditional « 15% systematic uncertainty to our results, permitting 
somewhat larger departures from HSE to be present in our clus¬ 
ters. That would still disfavor a hydrostatic bias of « 20% or more, 
which is one interpretation of the results from Planck Collaboration 
etal. (2014). 

Our results also allow the possibility that the current Chandra 
calibration may bias temperatures measured from continuum emis¬ 
sion high, though we cannot place a strong constraint on the size of 
such a bias. It is well documented that a discrepancy exists between 
the XMM-Newton and Chandra temperature measurements, with 
Chandra temperatures currently being higher than XMM (Mahdavi 
et al. 2013; Nevalainen et al. 2010; Schellenberger et al. 2015). One 
examination of broad-band versus line-emission measurements of 
cluster temperatures suggests that the true temperatures may lie be¬ 
tween those inferred from fitting the continuum emission measured 
by the two telescopes (Schellenberger et al. 2012). If Chandra tem¬ 
peratures are indeed biased high, then so would be Chandra hydro¬ 
static masses. If the calibration bias exceeds the bias due to non- 
thermal support, this would result in a lensing to X-ray mass ratio 
of less than 1, consistent with our results. 

6.2 Comparison to Literature 

Other groups have previously compared lensing masses to X-ray 
hydrostatic masses measured with Chandra. In particular, Donahue 
et al. (2014) have eleven clusters in common with our extended 
sample, and most of the raw Chandra and lensing data are the same. 
Donahue et al. (2014) independently processed the Chandra data 
and adopted lensing masses from Umetsu et al. (2014). They use a 
weighted mean to measure a lensing to X-ray ratio of r = 1.14 ± 
0.09 at 0.5Mpc, approximately r 25 oo for their clusters. 

It is unclear why the results from Donahue et al. (2014) dif¬ 
fer significantly from our measurements. In part, the clusters an¬ 
alyzed in Donahue et al. (2014) are less relaxed on average than 


the clusters included in our sample, according to the measurements 
of Paper I. However, even our measurements of the “relaxed WtG 
plus marginal sample” are still in some tension with Donahue et al. 
(2014), given the nearly identical input data. The differences are 
not obviously caused by discrepant lensing analyses. Umetsu et al. 
(2014) compared their lensing masses with Weighing the Giants 
lensing masses at 1.5Mpc and found general agreement between 
the two works, with a median ratio (WtG/CLASH) of 1.02 and a 
geometric mean of 1.10. Note that taking the geometric mean as the 
actual WtG/Clash offset only exacerbates the discrepancy between 
Donahue et al. (2014) and this work. Umetsu et al. (2014) mea¬ 
sure an average concentration consistent with our assumed mass- 
concentration model, making a radius-dependent offset between 
WtG and CLASH unlikely. 

Turning to the X-ray measurements, we find some discrepan¬ 
cies between the Donahue et al. (2014) analysis and our analysis. 
For the eleven clusters in common. X-ray masses at r 25 oo in that 
work are « 20% lower than our measurements, with « 15% scatter. 
We do not expect the Chandra calibration to be an issue, as Don¬ 
ahue et al. (2014) calibrate their data with CALDB 4.5.9, which 
should be comparable to the CALDB version 4.6.1 used in our 
work. One clear methodological difference is that Donahue et al. 
(2014) bin their spectra and use fitting statistic in their anal¬ 
ysis, which is known to bias measurements of temperature when 
applied to relatively noisy data (Cash 1979). In contrast, our proce¬ 
dure minimally bins the spectra, and instead uses the C-statistic to 
correctly describe the Poisson nature of the observed counts. We es¬ 
timate that perhaps as much as half of the discrepancy between our 
X-ray masses and those of Donahue et al. (2014) could originate 
in this difference, based on fests fitting mock spectra with realistic 
backgrounds and signal-to-noise. However, a full resolution of the 
discrepancy would require more detailed comparisons. 

The works of Vikhlinin et al. (2009) and Israel et al. (2014) 
also feature WL calibrations of Chandra derived X-ray masses. 
However, no clusters are in common between this work and Is¬ 
rael et al. (2014), while only 2 clusters are in common with the 
Vikhlinin et al. (2009) calibration sample (Benson, priv comm). 
Furthermore, both works use a custom Chandra temperature cali¬ 
bration from Vikhlinin et al. (2005) that is not publicly available, 
and neither work restricts itself to relaxed clusters. 

Both Mahdavi et al. (2013) and Zhang et al. (2010) compare 
WL masses to XMM-Newton derived hydrostatic masses for sam¬ 
ples of relaxed clusters. However, few clusters are in common be¬ 
tween those samples and this work. Also, both of those works use 
lensing data, from Hoekstra et al. (2012) and Okabe et al. (2010), 
respectively, that are known to sharply disagree with Weighing the 
Giants masses. Both analyses have been revised in the interim (see 
Hoekstra et al. 2015; Okabe et al. 2013). A comparison of the mass 
ratio results would therefore be of limited value. 

Insufficient information is currently available to generalize 
measurements of the lensing to X-ray mass ratio from one X-ray 
telescope or X-ray calibration version to others. Ideally, the com¬ 
munity needs a large sample of relaxed clusters that are consis¬ 
tently analyzed with each calibration version and both Chandra 
and XMM. This would facilitate comparisons among X-ray based 
results measured at different times. 

6.3 Cosmology Dependence 

Both the X-ray and lensing masses in this analysis depend on cos¬ 
mology. The X-ray hydrostatic mass depends on the physical vol¬ 
ume filled by gas of a given temperature, resulting in a dependence 
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on the angular diameter distance D to the cluster. Lensing masses, 
on the other hand, depend on both the angular diameter distance to 
the cluster and on the angular diameter distances from the observer 
and cluster to the background lensed galaxy population. For exam¬ 
ple, the 3D lensing mass within a fixed angular aperture will scale 
approximately as 

Mi.(<R)cc = —-. (1) 

^lens-source 

Equation 1 is exact for a Single Isothermal Sphere (SIS) model 
with all sources at one redshift. The actual scaling depends on the 
type of mass calculated (SIS vs. NFW, aperture statistic vs. profile 
fit), the angular size of the measurement aperture, the functional 
form used to model the mass distribution, and the redshift distri¬ 
bution of lensed galaxies. For flat models, this scaling is roughly 
independent of but can vary by up to 10% with w, as seen 
in figure 5. Since lensing masses depend on cosmology differently 
from X-ray hydrostatic masses, the lensing to X-ray mass ratio will 
also have a cosmology dependence. To check this toy model, we 
re-evaluated the lensing to X-ray mass ratio from section 3 at four 
cosmologies, assuming flatness, that encompass the 2a allowed re¬ 
gion of the /gas experiment from Paper II, Q„, = [0.21,0.40] and 
w = [-2.0, -0.35]. The mass ratio shifted as expected for clusters 
at z ;£ 0.5, changing by less than x: 5%. 

X-ray hydrostatic masses are only one example where a cali¬ 
bration with lensing is cosmology dependent. X-ray gas mass and 
Tj mass proxies measured within a fixed aperture scale as 
Therefore the lensing calibration of these mass proxies will be even 
more sensitive to cosmology than hydrostatic masses. While this 
sensitivity to cosmological parameters is perhaps negligible for to¬ 
day’s experiments, next generation experiments and measurements 
of high redshift clusters will need to account for it. Our solution 
is to explicitly include the lensing data in the cosmological likeli¬ 
hood, by effectively fitting for lensing masses simultaneously with 
cosmology (see Section 2.4). In this way, we account for the ex¬ 
act cosmology dependence of the lensing masses and the relevant 
X-ray mass proxy. 

As an example of why one should explicitly include the lens¬ 
ing data in a cosmological likelihood, we investigate how the cos¬ 
mology dependence of the lensing to X-ray ratio manifests itself 
in the /gas test. In the context of the /gas test, the mass ratio scales 
as K(z) oc (see equation 2 from Paper II, where K(z) is 

the mass ratio). Figure 6 shows the degeneracy between the mass 
ratio and Dn, that appears in practice when fitting the full model 
of Paper II (for a flat constant-w model), reflecting the 
dependence. Neglecting the cosmology dependence of the lensing 
calibration would introduce artificial priors on cosmological con¬ 
straints from the /gas test (or any other cosmological test that uses 
lensing as a calibrator), potentially biasing results and underesti¬ 
mating uncertainties. Conversely, the cosmological model space 
explored has an effect on the inferred mass ratio, albeit negligi¬ 
ble with the current statistical uncertainties. We find that measured 
lensing to X-ray mass ratio shifts by Ar = 0.01 between a flat 
constant-w model and a ACDM model with curvature. 


6.4 Blind Analysis 

Our aim in this work is to calibrate X-ray hydrostatic mass mea¬ 
surements for relaxed clusters to ~ 10% accuracy, a level where 
“observer’s bias” must be guarded against. There is a clear expecta¬ 
tion in the literature that X-ray hydrostatic masses should be biased 




Figure 5. Plotted curves show how lensing masses in fixed apertures (yel¬ 
low) and the WL to X-ray mass ratio (hydrostatic masses in blue and gas 
mass proxies in green) scale with H,,, (top) and w (bottom) as a function 
of X-ray mass type (colors) and by cluster redshift (solid: z=0.25, dashed: 
z=1.0), as given by equation 1. All curves are normalized to n„, = 0.3, 
w = -1. Whereas the lensing to X-ray hydrostatic mass ratio does not vary 
with n„, at a level significant for this study, future work calibrating X-ray 
mass measurements for high redshift clusters will need to account for this 
dependence. 


low compared to the true mass (Nagai et al. 2007). One group pre¬ 
viously re-calibrated Chandra X-ray masses downwards by 15% to 
match these expectations (Mahdavi et al. 2013). 

To combat observer’s bias, our lensing and X-ray analyses 
were completed independently and by different members of the 
team. For the X-ray analysis, gas and total mass profiles for in¬ 
dividual clusters were blinded with random offsets until after the 
relaxed cluster sample was identified and their profiles analyzed 
(Paper II). For the lensing analysis, fit procedures and systematic 
error estimates were fixed before comparisons to literature lensing 
values occurred (Applegate et al. 2014). Comparison between lens¬ 
ing and X-ray hydrostatic masses occurred only after the sample of 
relaxed clusters was finalized. 

For full disclosure, we briefly discuss changes that occurred in 
the analysis during and after the unblinding process. Following the 
selection of relaxed clusters, but before values were unblinded, 
we computed the lensing to X-ray mass ratio for 6 clusters using the 
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Figure 6. Joint Icr and 2a confidence regions for Hn, and the leasing to X- 
ray mass ratio for the /gas measurement, marginalized over the full model 
used in Paper II (specifically for a flat, constant-w cosmology). In the /gas 
model, the mass ratio is proportional to whereas the extent of the 

allowed region is primarily set by the precision of the mass ratio measure¬ 
ment. As these results come directly from the analysis of Paper II, they use 
the May 2012 Chandra calibration (CALDB 4.4.10) and do not include the 
correction for the lensing mass model. This has a small effect on the mass 
ratio; see Sections 2.3 and 4.1. 


P(z) likelihood from Applegate et al. (2014), measuring a ratio of 
r = 0.95±0.14. Following the unblinding of /g^s values, we received 
the referee report for Applegate et al. (2014); however the report 
did not result in changes to the analysis. Upgrades to the color-cut 
analysis described in Section 2, most notably marginalizing over a 
prior on concentration appropriate for relaxed clusters, were then 
finalized before lensing and X-ray masses for the full sample of 
12 clusters were compared, resulting in a ratio of r = 0.9591^^^. 
Subsequently, we concluded that the cosmology dependence of the 
lensing to X-ray mass ratio must be included in the /gas analysis, 
resulting in the lensing likelihood for color-cut masses being incor¬ 
porated into the /gas analysis (see Section 6.3). The Chandra effec¬ 
tive area calibration was also updated, to CIAO version 4.6.1 and 
CALDB version 4.6.2 (Section 4.1). During that process, we fixed 
a bug in the importance sampling algorithm used to evaluate the 
ratio at fixed cosmology (Section 3). Immediately before submis¬ 
sion, new simulations results became available indicating that the 
lensing masses were underreported by a few percent (Section 2.3), 
after which a correction factor was introduced. The cumulative ef¬ 
fect of these changes shifted the ratio for our reference cosmology 
to 0.956 ± 0.082. 


7 SUMMARY 

In this work, we have calibrated Chandra hydrostatic masses for re¬ 
laxed clusters with accurate weak lensing measurements from the 
Weighing the Giants project (Applegate et al. 2014; Kelly et al. 
2014; von der Linden et al. 2014a). Specifically, we measured the 
combined astrophysical and instrumental bias present for the Mantz 


et al. (2014) sample. Our approach fully captures lensing and X- 
ray measurement uncertainties, correlations between the measure¬ 
ments when using a common analysis aperture, the intrinsic scat¬ 
ter between X-ray and lensing masses, and the cosmology depen¬ 
dence of both. For a fixed cosmology and a sample of 12 clusters, 
we measure a lensing to X-ray mass ratio of r = 0.96 within the 
Chandra-determined r 25 oo aperature with a statistical precision of 
9% and additional systematic uncertainty from lensing of 9%, for 
a combined precision of = 12%. These results are for the specific 
Chandra calibrations tested, and cannot be easily extrapolated to 
other versions of the Chandra calibration. We find that our results 
are robust to perturbations in the sample selection and lensing anal¬ 
ysis. 

We interpret the measured X-ray to lensing ratio as ruling out 
large departures from hydrostatic equilibrium, on the order of tens 
of percent, at the measurement radii in our relaxed target clusters. 
Since the effects of biases in the X-ray effective area calibration are 
degenerate with biases due to non-thermal pressure support, more 
robust constraints on the level of non-thermal support from lens¬ 
ing and X-ray data will not be possible without an independent 
constraint on the X-ray calibration, e.g. from precisely measured 
emission line ratios. 

By measuring the lensing to X-ray mass ratio within the cos¬ 
mological framework of Paper 11, we are able to render the results 
of that work independent of overall shifts in the Chandra temper¬ 
ature calibration. We are also able to naturally capture the cosmol¬ 
ogy dependence of the lensing to X-ray mass ratio by simultane¬ 
ously modeling the lensing signal for the 12 clusters in this sample 
with the rest of the /gas data set. The improvements to cosmological 
constraints that the lensing enables are significant, as discussed in 
detail in Paper IT 

There are clear opportunities for improvement of the lensing- 
based calibration for the /gas experiment (as well as cluster counts; 
see discussion in Mantz et al. 2015b) in the next few years. Efforts 
are under way to improve key lensing systematics, most notably 
with measuring shear (Kitching et al. 2010) and with mass mod¬ 
eling of clusters (LSST Dark Energy Science Collaboration 2012). 
With these improvements coming into place, the remaining limita¬ 
tion to an improved measurement of the lensing to X-ray mass ratio 
is the number of clusters with sufficient optical filter coverage for 
P (z) measurements, which eliminate a key systematic in the lens¬ 
ing masses used in this work (see Applegate et al. 2014). Expand¬ 
ing the sample of P (z) cluster masses from the current 6 clusters to 
~ 24 clusters would already improve the mass ratio precision, and 
thus constraints on fl„,, by 30%. 
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APPENDIX A: RATIO MODEL WITH INTRINSIC 
SCATTER AND COMMON APERTURES 

There are multiple, mathematically equivalent ways to implement 
a ratio model that includes intrinsic scatter and accounts for cor¬ 
relations induced from a shared measurement aperture. For com¬ 
pleteness, we describe here one of two ways that was used in this 
work. 

We need to evaluate the posterior probability function 
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P{r,cr[a\Ox^DL), where r is the lensing to X-ray mass ratio, trin, 
is the magnitude of intrinsic scatter present in the sample. Ox 
is the X-ray data vector, and Ol is the lensing data vector. For 
each cluster, we first constrain the parameters of an NFW mass 
model with the X-ray and lensing data independently, thereby ob¬ 
taining samples from the probability distributions for the mass 
Mj and concentration C, from each data set, P(Mjx, Ci^x\Di,x) 
and P(MjL, Note that we also include a prior on the 

concentration measured by lensing, PiCi^t), which we will ig¬ 
nore in the following description. The X-ray or lensing ^2500 mass 
within the measurement aperture is a deterministic function of the 
NFW halo parameters: Mff = Mi^x,Cix) and = 

Mjj(,Cix), respectively. The exact parameter depen¬ 
dence of each depends on the chosen measurement aperture. This 
accounts for the use of a common measurement aperture. 

The posterior probability function P(r,(Ti^i\Ox,O0 can then 
be expanded as 


P(r,o-,JO)ocY] J dM,xdC,,xdM^,^dC,,^P(r,o-,JM^lM^l) 

PiMij,, C„x|D,x)/’(M,,z., C,,dD,,L), (Al) 

where the deterministic calculations of and have been 
suppressed. In our model with log normal scatter, 

(inruAP-lnA/AP)^ 

P(r, Mf'l) = - ^——e . (A2) 

ra-midln 

We evaluate the integrals in equation Al numerically using 
Monte Carlo integration, transforming the integrals into weighted 
averages over the pre-computed samples from P(Mjx, CixlOi^x) 

andF(M,,i,C,,dD,,L)- 
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